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Avian myeloblastosis virus reverse transcriptase (AMV 
RT) is routinely used in the sequence analysis of RNA 
and DNA templates. We review the various methods 
for dealing with secondary structures that would other- 
wise result in premature termination or sequence com- 
pression. Based on our experience in sequencing the 
ll-kb single-stranded RNA genome of lymphocytic 
choriomeningitis virus, we have found that raising the 
reaction temperature above 47° C is the simplest way to 
overcome template secondary structure, and the use of 
98% formamide gels is the simplest way to overcome 
product secondary structure. 

We have used avian myeloblastosis virus reverse 
transcriptase (AMV RT) for direct RNA sequence 
analysis of a single-stranded RNA virus, lympho- 
cytic choriomeningitis virus (LCMV). We wanted 
an RNA sequencing method that would allow us 
to sequence several viral variants in tandem and 
that would circumvent the need to clone viral 
cDNA. 

In performing dideoxy sequence analysis with 
AMV RT we encountered two types of secondary 
structure problems: those involving template 
structures that terminate transcription, and those 
involving product (cDNA) structures that com- 
press C or G tracts of sequence. Several ap- 
proaches for overcoming template secondary 
structure have been described in the literature: 
treatment of RNA templates with 20 mM methyl 
mercury [1, 2], use of dimethyl sulfoxide in reac- 
tions with double-stranded reovirus RNA tem- 
plates [3], addition of T4 gene 32 protein (a single- 
stranded binding protein) to sequencing reactions 
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[4], and use of high reaction temperature (50°C) to 
sequence tRNA genes in chromosomal DNA [5]. 
Sequence compressions in the reaction products 
have been overcome by electrophoresis on gels 
containing 98% formaldehyde [6] or by incorpo- 
rating nucleotide analogues that are not prone to 
intrachain hydrogen bonding [7, 8]. 

Chemical methods of sequence analysis are 
less susceptible to template secondary structure 
than the enzymatic methods. Auperin et al. [9] 
end-labeled LCMV RNA and performed base- 
specific cleavage reactions [10] in order to se- 
quence the termini of the virus. We have also 
used chemical sequencing [11] on difficult regions 
of the LCMV genome, i.e., regions in which tem- 
plate secondary structure caused premature ter- 
minations with AMV RT. This involved AMV RT 
extension of an end-labeled primer, followed by 
purification of the longest cDNA products and 
chemical cleavage of these products to obtain se- 
quence [11]. Unfortunately, the template purifica- 
tion steps required for the chemical sequencing 
methods makes them much more difficult than the 
enzymatic chain terminating methods with labeled 
primers. The enzymatic method is ideal for im- 
pure templates such as whole chromosomal DNA 
or whole cellular RNA because a labeled primer 
will anneal to and extend a specific sequence. To 
obtain RNA sequence directly from either viral 
genomic RNA or infected cellular RNA we chose 
to use enzymatic chain termination with addi- 
tional denaturants in the event of problematic sec- 
ondary structure. 

Methods 

Oligonucleotide Primers 

Our reactions made use of radioactive oligonu- 
cleotides, generally 20 nucleotides in length, syn- 
thesized on an Applied Biosystems model 380A 
DNA Synthesizer. Oligonucleotides were de- 
blocked in ammonia for 10 hours at 50°C, dried in 
a spin vacuum, and resuspended in 0.5 ml distilled 
water and butanol extracted four times until 
volume was 50-100 yA. They were dried once 
again, resuspended in water, and 5 jxl (approxi- 
mately 10 A 260 U) was loaded onto a thin 20% 
acrylamide gel for preparative electrophoresis 
[12]. The oligonucleotide was visualized by 
transferring the gel on cellophane to a fluorescent 
silica gel plate (60.F-254, EM Science). The ab- 
sorbent band was excised with a razor blade and 
placed under 1 ml TE (10 mM Tris-HCl, pH 8, 1 
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mM EDTA) at 4°C to elute overnight. The next 
day the gel slice was rinsed with an additional 0.5 
ml TE, and the eluate was passed through a dis- 
posable 0.45 |xm nitrocellulose filter to remove 
acrylamide pieces. The eluted oligonucleotide 
was made 0.1 M Na-acetate, 15 jxg/ml in glycogen 
(carrier) and precipitated with three volumes of 
ethanol. The pellet was suspended in TE, and the 
oligonucleotide concentration was determined 
using an extinction coefficient of 20 (i.e., dilution 
x A260 x 20 = fxg/ml). The molar conversion of 
7000 g/mole for a 20 mer or 140 pmoles/|xg was 
used. A 20 yM oligonucleotide stock was made 
and kinased as described [11]. A typical kinase 
reaction contained 50 pmoles oligonucleotide, 5 
units kinase, 50 mM Tris-HCl, pH 7.6, 10 mM 
MgCl 2 , 5 mM dithiothreitol, 0.1 mM spermidine, 
0.1 mM EDTA, 10 |xCi 7- 32 P-ATP, and was incu- 
bated for 30 minutes at 37°C. This reaction was 
phenol extracted, then phenohchloroform (1:1) 
extracted, and then ethanol precipitated in the 
presence of 5 jxg glycogen. The pellet was resus- 
pended at 2 x 10 5 cpm/ixl, and 2.5 fxl was used per 
sequencing reaction. 



AMV RT Reaction Conditions 

The RNA template was denatured with the radio- 
active oligonucleotide at 100°C for 3 minutes and 
allowed to anneal for 30 minutes at 42°C. Reac- 
tions took place in microtiter dishes so that sev- 
eral could be performed at once using a 2 |xl re- 
peating dispenser (Hamilton PB6000): 2 jxl of an- 
nealed mix per well, 2 |xl of dilute AMV RT per 
well, and 2 jil of dideoxy nucleotide mix per well. 
Reagent concentrations were as published by 
Hamlyn et al. [15] with some modifications. We 
did not incorporate isotope during the reactions 
but instead used kinased oligonucleotide primers. 
We used higher nucleotide concentrations, but the 
same ratio of nucleotides to dideoxynucleotides 
(i.e., [dNTP]:[ddNTP] is 200 [xM:40 \lM rather 
than 50 jjlM:10 |xM). Our annealed mix is 0.2 
mg/ml in viral RNA or 5 mg/ml in total infected- 
cell RNA and 0.2 jxM in oligonucleotide. Reac- 
tions were incubated at 42°C (usually) for 30 
minutes, chased with cold nucleotides for 15 
minutes, stopped with 4 |xl/well formamide dye 
buffer, heated at 90°C for 5 minutes, and loaded 
onto thin gradient electrophoresis gels as de- 
scribed [13]. Gels were run at 40 W (bringing them 
to approximately 65°C) for 3 hours. 



Additional Denaturants 

Various denaturants were used to overcome 
problems with template and product secondary 
structure. Viral RNA at 1 mg/ml was treated with 
20 mM methyl mercury (Alpha Products, MD) for 
10 minutes at room temperature, then treated with 
0.1 mM 0-mercaptoethanol for 10 minutes at room 
temperature, then ethanol precipitated, resus- 
pended, and used directly for sequencing. Alter- 
natively, T4 gene 32 protein (Boehringer Mann- 
heim) was added at 1 |xg/ml to sequencing reac- 
tions and removed with proteinase K as described 
[4]. In other trials, template RNA was treated 
with 90% dimethyl sulfoxide, annealed in 18% di- 
methyl sulfoxide, and the reactions were carried 
out with a residual 1.8% dimethyl sulfoxide as de- 
scribed [3]. High temperature was used as a de- 
naturant without changing the conditions of our 
routine sequencing reactions as described above. 
To overcome problems with product secondary 
structure, sequencing reactions were run on thin 
polyacrylamide gels that were similar in buffer 
composition to the routine gels, but deionized 
formamide was substituted for water, resulting in 
98% formamide [6]. 

Results 

We encountered two regions of problematic sec- 
ondary structure in sequencing the single- 
stranded RNA of LCMV. Both were intergenic 
regions: the first was a 21 -base hairpin of AG = 
-47 kcal, and the other was a more difficult 
structure for which we have no simple free energy 
calculation. The first secondary structure region 
was sequenced using the usual reaction conditions 
and running the products on a denaturing form- 
amide gel as described in Methods. The more dif- 
ficult structure withstood a variety of ap- 
proaches. Treatment of the template with methyl 
mercury did not allow us to obtain sequence past 
the "knot" of premature terminations. Inclusion 
of T4 gene 32 protein in the sequencing reaction 
worked to some degree, that is, a faint but dis- 
cernible sequence could be detected beyond the 
secondary structure. This single-stranded binding 
protein was recommended for DNA sequencing at 
37°C [4] and is somewhat useful for RNA se- 
quencing at 42°C as well. Another denaturant, di- 
methyl sulfoxide, allowed us to read some se- 
quence beyond the secondary structure, but the 
quality of sequence was similar to that obtained 
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with T4 gene 32 protein. The clearest sequence 
information beyond the secondary structure was 
obtained with high temperature reaction condi- 
tions. As can be seen from Figure 1, sequence 
reactions at 42°C resulted in premature termina- 
tion by the AMV RT, whereas sequencing reac- 
tions at 47°C and 50°C, although they also had 
premature terminations, extended sufficiently 



through the region of secondary structure that a 
readable sequence ladder could be discerned. 

We were concerned that under our reaction 
conditions, the optimum temperature for nucleo- 
tide incorporation by AMV RT was no longer 
42°C. We performed incorporation assays at 37, 
42, 47, 50, and 55°C and found that optimum in- 
corporation was still 42°C, with half as much in- 
corporation at 50°C and 10% incorporation at 
55°C. Nevertheless, we were able to read a crisp 
sequence from reactions run at 47°C, 50°C (Figure 
1), and even 55°C (now shown). 

To overcome problems with product secondary 
structure, nucleotide analogues are frequently 
used. The incorporation of deoxyinosine-5 '-tri- 
phosphate (dITP), a guanine analogue, requires 
that separate ddNTP mixes be made containing a 
4:1 mixture of dITP:dGTP in place of the usual 
dGTP [14]. Similarly, 7-deaza-dGTP (7-deaza-2'- 
deoxyguanosine-5'-triphosphate) can be substi- 
tuted for dGTP at a molar ratio of 1:1 (dGTP: 7- 
deaza-dGTP) [7]. The inclusion of formamide in 
the sequencing gel also denatures product sec- 
ondary structure and prevents sequence compres- 
sion [6]. The formamide gels require more time 
and Temed for polymerization as well as more 
time to run, but are just as effective as the guanine 
analogues in solving the problem of product sec- 
ondary structure. We chose to use formamide 
gels to solve this problem rather than maintaining 
separate ddNTP mixes for these purposes, but 
this is up to the individual investigator. 



Figure 1 . Dideoxynucleotide sequence analysis of LCMV 
RNA at different reaction temperatures: 42°C, 47°C, and 50°C. 
This intergenic region of the viral RNA was problematic in that 
it caused premature termination of the AMV RT reactions as 
well as sequence compressions. 



Discussion 

The breakthrough in the use of AMV RT for se- 
quence analysis was the discovery [15, 16] that 
this enzyme has a greater affinity for the dideoxy 
analogues than does the DNA polymerase origi- 
nally used for enzymatic sequencing [17]. The 
ddNTP mixes for DNA polymerase catalyzed re- 
actions could not be used for AMV RT; instead, 
the concentration of ddNTP chain terminator in 
each mix had to be reduced at least 50-fold such 
that the ratio of dNTP:ddNTP was 5:1 rather than 
1:10. Buffer and temperature conditions had al- 
ready been optimized for the use of AMV RT in 
the production of cDNA [2, 18], so it was a rea- 
sonable advance to use an RNA-dependent poly- 
merase for chain termination sequencing. This 
method quickly surpassed the chemical se- 
quencing methods in use for RNA sequence anal- 
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ysis simply because there were fewer procedures 
involved. A second advance was the use of end- 
labeled primers for sequence analysis of impure 
templates. This meant that template purification 
steps were eliminated, and sequence could be ob- 
tained from total cellular RNA or from chromo- 
somal DNA. 

We have used direct RNA sequencing with 
7- 32 P-labeled oligonucleotide primers in a step- 
wise manner. The first oligonucleotide primer we 
used depended upon a chemical sequence deter- 
mination of the 3' end of the viral genomic RNA 
[9]. Subsequent primers were synthesized ac- 
cording to sequence obtained from the previous 
primer extension chain-termination reactions. We 
encountered two regions of extensive secondary 
structure, one was a 21-base hairpin in the region 
between the glycoprotein and nucleoprotein genes 
of LCMV, and the other was in the region be- 
tween the L protein gene and a newly discovered 
region of 500 bases at the 5' end of the genome 
(M. Salvato and E. Shimomaye, in preparation). 
The first region (a 21-base intergenic hairpin) has 
a calculated free energy of -47 kcal/mole [19] and 
was sequenced at the usual temperature (42°C) but 
the reactions were run on 98% formamide gels. 
The second region of intergenic structure proved 
to be much more difficult. Although it has a cal- 
culated free energy of only -30 kcal, it is prob- 
ably involved in tertiary base pairing such as the 
pseudoknot structures predicted for viral RNAs 
[20, 21], and sequence could only be obtained by 
the' use of denaturant chemicals or by high reac- 
tion temperatures, it was necessary to use 98% 
formamide gels to avoid the sequence compres- 
sions caused by* intrachain base stacking and 
pairing. 

The observation that AMV RT terminates 
within intergenic regions at a physiologic tempera- 
ture suggests an involvement of RNA structure in 
the regulation of viral transcription. It is possible 
that intracellular denaturants, such as the T4 
single-strand binding protein, enable transcription 
to proceed through regions of RNA higher order 
structures during viral replication but are non- 
functional during messenger transcription. 
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